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children and adolescents more than adults and it is associated with nonisenelrating injuries to bones. Infection typically 
occurs in the long end of growing bone, hence its occurrence in physically immature populations. Most often, infection 
is localized in the vicinity of sprouting capillary loops adjacent to epiphysial growth plates in the end of long, growing 
bones. 

s 

Skin infections 

S. aureus is the most common pathogen of such minor skin infections as abscesses and boils. Such infections 
often are resolved by normal host response mechanisms, but they also can develop into severe internal infections. 
70 Recurrent infections of the nasal passages plague nasal carriers of S. aureus. 

Surgical Wound Infections 

Surgical wounds often penetrate far into the body Infection of such wound thus poses a grave risk to the patient. 
IS s. aureus is the most important causative agent of infections in surgical wounds. S. aureus is unusually adept at 
invading surgical wounds; sutured wounds can be infected by far fewer S. aureus cells then are necessary to cause 
infection in normal skin. Invasion of surgical wound can lead to severe S. aureus septicaemia. Invasion of the blood 
stream by S. aureus can lead to seeding and infection of Internal organs, particularly heart valves and bone, causing 
systemic diseases, such as endocarditis and osteomyelitis. 

20 

Scalded Skin Syndrome 

S. aureus is responsible for 'scalded skin syndrome' (also, called toxic epidermal necrosis. Ritter's disease and 
Lyeirs disease). This diseases occurs in older children, typically in outbreaks caused by flowering of S. aureus strains 
2S produce e)cfoliation{also called scalded skin syndrome toxin). Although the bacteria initially may infect only a minor 
lesion, the toxin destroys intercellular connections, spreads epidermal layers and allows the infection to penetrate the 
outer layer of the skin, producing the desquamation that typifies the diseases. Shedding of the outer layer of skin 
generally reveals normal skin below, but fluid lost in the process can produce severe injury in young children if it is not 
treated properly. 

30 

Toxic Shock Syndrome 

Toxic shock syndrome is caused by strains of S. aureus that produce the so-catled toxic shock syndrome toxin. 
The disease can be caused by S. aureus infection at any site. but it is too often erroneously viewed exclusively as a 
3S disease solely of women who use tampons. The disease involves toxaemia and septicaemia, and can be fatal. 

Nocosomial Infections 

. In the 1984 National Nocosomial Infection Sun/eiilance Study ("NNIS") S. aureus was the most prevalent agent 
^0 of surgical wound infections in many hospital services, including medicine, surgery, obstetrics, pediatrics and newborns. 

Resistance to drugs of S. aureus strains 

Prior to the introduction of penicillin the prognosis for patients seriously infected with S. aureus was unfavorable. 
4S Following the introduction of penicillin in the early 1 940s even the worst S. aureus infections generally could be treated 
successfully. The emergence of penicillin-resistant strains of S. aureus did not take long, however. Most strains of S. 
aureus encountered in hospital infections today do not respond to penicillin; although, fortunately, this is not the case 
for S. aureus encountered in community infections. 

It is well known now that penicillin-resistant strains of S. aureus produce a lactamase which converts penicillin to 
50 pencillinoic acid, and thereby destroys antibiotic activity Furthermore, the lactamase gene often is propagated episo- 
mally, typically on a plasmid. and often is only one of several genes on an episomal element that, together, confer 
multidrug resistance. 

Methicillins. introduced in the 1950s, largely overcame the problem of penicillin resistance in S. aureus. These 
compounds conserve the portions of penicillin responsible for antibiotic activity and modify or alter other portions that 
55 . rnake penicillin a good substrate for inactivating lactamases. However, methicillin resistance has emerged in S. aureus, 
along with resistance to many other antibiotics effective against this organism, including aminoglycosides, tetracycline, 
chloramphenicol, macroiides and lincosamides. In fact, methicillin-resistant strains of S. aureus generally are multiply 
drug resistant. 
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inlormanon herein described stored in a data storage means. Such systems are designed to identify commercially 
important fragments of the Staphyfococcus aureus genome. 

Another embodiment of the present invention is directed to fragments, preferably isolated fragments, of the Sta- 
phylococcus aureus genome having particular structural or functional attributes. Such fragments of the Staphylococcus 
5 aureus genome of the present invention include, but are not limited to. fragments which encode peptides, hereinafter 
referred to as open reading frames or ORFs." fragments which modulate the expression of an operably linked ORF. 
hereinafter referred to as expression modulating fragments or EMFs." and fragments which can be used to diagnose 
the presence of Staphylococcus aureus in a sample, hereinafter referred to as diagnostic fragments or "OFs.' 

Each of the ORFs in fragments of the Staphylococcus aureus genome disclosed in Tables 1*3. and the EMFs 
70 found 5* to the ORFs. can be used in numerous ways as polynucleotide reagents. For instance, the sequences can be 
used as diagnostic probes or amplification primers for detecting or determining the presence of a specific microbe in 
' a sample, to selectively control gene expression in a host and in the production of polypeptides, such as polypeptides 
encoded by ORFs of the present invention, particular those polypeptides that have a pharmacological activity. 

The present invention further includes recombinant constructs comprising one or more fragments of the Staphy- 
15 lococcus aureus genome of the present invention. The recombinant constructs of the present invention comprise vec- 
tors, such as a plasmid or viral vector, into which a fragment of the Staphylococcus aureus has been inserted. 

The present invention further provides host cells containing any of the isolated fragments of the Staphyfococcus 
aureus genome of the present invention. The host cells can be a higher eukaryotic host cell, such as a mammalian 
cell, a lower eukaryotic cell, such as a yeast cell, or a procaryotic cell such as a bacterial celt. 
20 The present invention is further directed to polypeptides and proteins, preferably isolated polypeptides and pro- 

teins, encoded by ORFs of the present invention. A variety of methods, well known to those of skill in the art. routinely 
may be utilized to obtain any of the polypeptides and proteins of the present invention. For instance, polypeptides and 
proteins of the present invention having relatively short, simple amino acid sequences readily can be synthesized using 
commercially available automated peptide synthesizers. Polypeptides and proteins of the present invention also may 
25 be purified from bacterial cells which naturally produce the protein. Yet another alternative is to purify polypeptide and 
proteins of the present invention can from cells which have been altered to express them. 

The invention further provides polypeptides, preferably isolated polypeptides, comprising Staphylococcus aureus 
epitopes and vaccine compositions comprising such polypeptides. Also provided are methods for vacciniating an in- 
dividual against Staphylococcus aureus infection. 
30 The invention further provides methods of obtaining homologs of the fragments of the Staphylococcus aureus 

genome of the present invention and homologs of the proteins encoded by the ORFs of the present Invention. Specif- 
■ ically. by using the nucleotide and amino acid sequences disclosed herein as a probe or as primers, and techniques 
such as PGR cloning and colony/plaque hybridization, one skilled in the art can obtain homologs. 

The invention further provides antibodies which selectively bind polypeptides and proteins of the present invention. 
35 Such antibodies include both monoclonal and polyclonal antibodies. 

The invention further provides hybridomas which produce the above-described antibodies. A hybridoma is an 
immortalized cell line which is capable of secreting a specific monoclonal antibody - 

The present invention further provides methods of identifying test samples derived from cells which express one 
of the ORFs of the, present invention, or a homolog thereof. Such methods comprise incubating a test sample with one 
40 or more of the antibodies of the present invention, or one or more of the Dfs or antigens of the present invention, under 
conditions which allow a skilled artisan to determine if the sample contains the ORF or product produced therefrom. 

In another embodiment of the present invention, kits are provided which contain the necessary reagents to carry 
out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in close confinement, one or more containers 
45 which co'mprises: (a) a first container comprising one of the antibodies, antigens, or one of the DFs of the present 
invention: and (b) one or more other containers comprising one or more of the following;wash reagents, reagents 
capable of detecting presence of bound antibodies, antigens or hybridized DFs. 

Using the isolated proteins of the present invention, the present invention further provides methods of obtaining 
and identifying agents capable of binding to a polypeptide or protein encoded by one of the ORFs of the present 
so invention. Specifically, such agents include; as further described below, antibodies, peptides, carbohydrates, pharma- 
ceutical agents and the like. Such methods comprise steps of: {a)contacting an agent with an isolated protein encoded 
by one of the ORFs of the present invention; and (b)determining whether the agent binds to said protein. 

The present genomic sequences of Staphylococcus aureus will be of great value to all laboratories working with 
this organism and for a variety of commercial purposes, fvlany fragments of the Staphylococcus aureus genome will 
55 be immediately identified by similarity searches against GenBank or protein databases and will be of immediate value 
to Staphylococcus aureus researchers and for immediate commercial value for the production of proteins or to control 
gene expression. 

The methodology and technology for elucidating extensive genomic sequences of bacterial and other genomes 
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libraries and fo- sequencing are provided below, for instance. A wide variety of Staphylococcus aureus strains that can 
be used to prepare S aureus genomic DNA for cloning and for obtaining polynucleotides of the present invention are 
available to the public from recognized deoosilory institutions, such as the American Type Culture Collection (ATCC ). 

The nucleotide sequences of the genomes from different strains of Staphylococcus aureus differ somewhat. How- 
ever the nucleotide sequences of the genomes of all Staphylococcus aureus strains will be at least 95% tdenticai. in 
corresponding part, to the nucleotide sequences provided in SEQ ID NOS:1-5.l9l- Nearly all wiii be at least 99.- 
identical and the great majority will be 99.9% identical. 

Thus the present invention further provides nucleotide sequences which are at least 95 /o. preferably 99 and 
most preferably 99.9% identical to the nucleotide sequences of SEQ ID NOS:1-5.l9l . in a form which can be readily 
used, analyzed and interpreted by the skilled artisan. 

Methods for determining whether a nucleotide sequence is at least 95%. at least 99% or at least 99.9 A identical 
to the nucleotide sequences of SEQ ID NOS: 1 -5.191 are routine and readily available to the skilled artisan. For example, 
the well known fasta algorithm described in Pearson and Lipman. Proc Natl. Acad. ScL USA BS: 2^ (1988) can be 
used to generate the percent identity of nucleotide sequences. The BLASTN program also can be used to generate 
an identity score of polynucleotides compared to one another. 

COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS:1-5.l9l. a representative fragment thereof, or a nucleotide 
sequence at least 95%. preferably at least 99% and most preferably at least 99.9% identical to a polynucleotide se- 
quence of SEQ ID NOS V5,191 may be -provided" in a variety of mediums to facilitate use thereof. As used herein. 
Oprovided- refers to a manufacture, other than an isolated nucleic acid molecule, which contains a nucleotide sequence 
of the present invention; /.e.. a nucleotide sequence provided in SEQ ID NOS:l-5.l91. a representative fragment 
thereof, or a nucleotide sequence at least 95%. preferably at least 99% and most preferably at least 99.9% identical 
to a polynucleotide of SEQ ID NOS:1 -5. 191. Such a manufacture provides a large portion of the Staphylococcus aureus 
genome and parts thereof (e.g.. a Staphylococcus aureas open reading frame (ORF)) in a form which allows a skilled 
artisan to examine the manufacture using means not directly applicable to examining the Staphylococcus aureus ge- 
nome or a subset thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present invention can be recorded on computer 
readable media. As used herein, 'computer readable media" refers to any medium which can be read and accessed 
directly by a computer Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard 
disc storage medium, and magnetic tape; optical storage media such as CD- ROM: electrical storage media such as 
RAM and ROM- and hybrids of these categories, such as magnetic/optical storage media. A skilled artisan can readily 
appreciate how any of the presently known computer readable mediums can be used to create a manufacture com- 
prising computer readable medium having recorded thereon a nucleotide sequence of the present invention. Likewise, 
it will be clear to those of skill how additional computer readable media that may be developed also can be used to 
create analogous manufactures having recorded thereon a nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process for storing information on computer readable medium. A skilled 
artisan can readily adopt any of the presently know methods for recording information on computer readable medium 
to generate manufactures comprising .the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a computer readable medium 
having recorded thereon a nucleotide sequence of the present invention. The choice of the data storage structure will 
generally be based on the means chosen to access the stored information. In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence information of the present invention on computer 
readable medium. The sequence information can be represented in a word processing text file, formatted in commer- 
cially- available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII tile, stored 
in a database application, such as DB2. Sybase. Oracle, or the like. A skilled artisan can readily adapt any number of 
data-processor structuring formats (e.g., text file or database) in order to obtain computer readable medium having 
recorded thereon the nucleotide sequence information of the present invention. 

Computer software is publicly available which allows a skilled artisan to access sequence information provided in 
a computer readable medium. Thus, by providing in computer readable form the nucleotide sequences of SEQ ID 
NOS-1-5.191. a representative fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% and 
most preferably at least 99.9% identical to a sequence of SEQ ID NOS:1-5,191 the present invention enables the 
skilled artisan routinely to access the provided sequence information (or a wide variety of purposes. 

The examples which follow demonstrate how software which implements the BLAST (Altschul etai, J. Mol. Bioi. 
215:403410 (1990)) and BLAZE (Brutlag ef a/., Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase 
system was used to identify open reading frames (ORFs) within the Staphylococcus aureus genome which contain 
homology to ORFs or proteins from both Staphylococcus aureus and from other organisms. Among the ORFs discussed 
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accessing and processing the genomic seouence (such as search tools, comparing tools, etc.) reside in main memory 
108. in accordance with the requirements and operating parameters of the operating system, the haraware system 
and the software program or programs. 

5 BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to fragments of the Staphylococcus aureus genome, 
preferably to isolated fraaments. The fragments of the Staphylococcus aureus genome of the present invention include, 
but are not limited to fragments which encode peptides, hereinafter open reading frames (ORFs). fragments which 
10 modulate the expression of an operably linked ORF. hereinafter expression modulating fragments (EMFs) and frag- 
ments which can be used to diagnose the presence of Staphylococcus aureus in a sample, hereinafter diagnostic 
fragrrienls (DFs). 

As used herein, an "isolated nucleic acid molecule" or an 'isolated fragment of the Staphylococcus aureus genome 
refers to a nucleic acid molecule possessing a specific nucleotide sequence which has been subjected to purification 
JS means to reduce, from the composition, the number of compounds which are normally associated with the composition. 
Particularly the term refers to the nucleic acid molecules having the sequences set out in SEQ ID NOS:1 -5.191 . to 
representative fragments thereof as described above, to polynucleotides at least 95%. preferably at least 99% and 
especially preferably at least 99.9% identical in sequence thereto, also as set out above. 

A variety of purification means can be used to generated the isolated fragments of the present invention. These 
20 include, but are not limited to methods which separate constituents of a solution based on charge, solubility, or size. 

In one embodiment. Staphylococcus aureus DNA can be mechanically sheared to produce fragments of 1 5-20 kb 
in length These fragments can then be used to generate an Staphylococcus aureus library by inserting them into 
lambda clones as described in the Examples below. Primers flanking, for example, an ORF, such as those enumerated 
in Tables 1-3 can then be generated using nucleotide sequence information provided in SEQ ID NOS: 1-5.191. Well 
25 known and routine techniques of PGR cloning then can be used to isolate the ORF from the lambda DNA library of 
Staphylococcus aureus genomic DNA. Thus, given the availability of SEC ID NOS:1-5.l91. the information in Tables 
1 2 and 3 and the information that may be obtained readily by analysis of the sequences of SEQ ID NOS:1-5.191 
using methods set out above, those of skill will be enabled by the present disclosure to isolate any QRFH:ontaining or 
other nucleic acid fragment of the present invention. 
30 The isolated nucleic acid molecules of the present invention include, but are not limited to single stranded and 

double stranded DNA, and single stranded RNA. 

As used herein, an "open reading frame." ORF. means a series of triplets coding for amino acids without any 
termination codons and is a sequence translatable into protein. 

Tables 1. 2 and 3 list ORFs in the Staphylococcus aureus genomic contigs of the present invention that were 
35 identified as putative coding regions by the GeneMark software using organism-specific second-order Markov proba- 
bility transition matrices. It will be appreciated that other criteria can be used, in accordance with well known analytical 
methods, such as those discussed herein, to generate more inclusive, more restrictive or more selective lists. 

Table 1 sets out ORFs in the Staphylococcus aureus contigs of the* present invention that are at least 80 ammo 
acids long and over a continuous region of at least 50 bases which are 95% or more identical (by BLAST analysis) to 
AO an S. aureus nucleotide sequence available through Genbank in November 1996. 

Table 2 sets out ORFs in the Staphylococcus aureus contigs of the present invention that are not in Table 1 and 
. match, with a BLASTP probability score of 0.01 or less, a polypeptide sequence available through Genbank by Sep- 
tember! 995. . 

Table 3 sets out ORFs in the Staphylococcus aureus contigs of the present invention that do not match significantly. 
-5 by BLASTP analysis, a polypeptide sequence available through Genbank by September 1 996. 

In each table, the first and second columns identify the ORF by, respectively, contig number and ORF number 
within the contig; the third column indicates the reading frame, taking the Hrst 5' nucleotide of the contig as the start of 
the +1 frame; the fourth column indicates the first nucleotide of the ORF. counting from the 5' end of the contig strand; 
and the fifth column indicates the length of each ORF in nucleotides. 
so tn Tables 1 and 2. column six. lists the Reference' for the closest matching sequence available through Genbank. 

These reference numbers are the databases entry numbers commonly used by those of skill in the art. who will be 
familiar.with their denominators. Descriptions of the numenclature are available from the National Center for Biotech- 
nology Information. Column seven in Tables 1 and 2 provides the gene name' of the matching sequence; column eight 
provides the BLAST identity- score from the comparison of the ORF and the homologous gene; and column nine 
55 indicates the length in nucleotides of the highest scoring segment pair" identified by the BLAST identity analysis, 
in Table 3. the last column, column six, indicates the length of each ORF in amino acid residues. 
Thexoncepts of percent identity and percent similarity of two polypeptide sequences is well understood in the art. 
For example, two polypeptides 10 amino acids in length which differ at three amino acid positions {e.g., at positions 
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phylococcus aureus, particularly those that distinguish medically important strain, such as drug-resistant strains. 

In addition the fragments of the present invention, as broadly described, can be used to control gene expression 
through triple helix formation or antisense ONA or RNA. both of which methods are based on the bina.ng of a polynu- 
cleotioe sequence to DNA or RNA. Triple helix- formation optimally results in a shut-off of RNA iranscnption from DNA. 
s while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. Information from the 
sequenc-s of the present invention can be used to design antisense and triple helix-forming oligonucleotides. Polynu- 
cleotides suitable for use in these methods are usually 20 to 40 bases in length and are designed to be complementary 
to a region of the gene involved in transcription, for triple+ielix fomnation, or to the mRNA itself, for antisense tnhioition. 
Both techniques have been demonstrated to be effective in mode! systems, and the requisite techniques are well known 
;o and involve routine procedures. Triple helix techniques are discussed in. for example. Lee et ai, Nucl Acids Res. 5: 
3073 (1979)- Cocney et at , Science!^: 455 (19S8): and Dervan et ai, Sc/ence 251: 1350 (1991). Antisense lech- 
niques in general are discussed in. for instance. Okano. 1 Neurochem. 55: 560 (1991) and OLIGOOEOXYNUCLE- 
OTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION. CRC Press. Boca Raton. PL (l95r))- 

• The present invention further provides recombinant constructs comprising one or more fragments of the Siaphy- 
75 lococcus aureus genomic fragments and contigs of the present invention. Certain preferred recombinant constructs of 
the oresent invention comprise a vector, such as a plasmid or viral vector, into which a fragment of the Staphylccoa:us 
aureus genome has been inserted, in a fonvard or reverse orientation. In the case of a vector compnsing one of the 
ORFs of the present invention, the vector may further comprise regulatory sequences, including for example, a pro- 
moter, operably linked to the ORP For vectors comprising the EMPs of the present invention, the vector may further 
20 comonse a marker sequence or heterologous ORP operably linked to the EMF 

Large numbers of suitable vectors and promoters are known to those of skill in the art and are commercially 
available for generating the recombinant constructs of the present invention. The following vectors are provided by 
way of examole Useful bacterial vectors include phagescript. PsiX174. pBluescript SK and KS (+ and ■). pNHSa. 
pNHISa pNHlSa pNH46a (available from Stratagene); pTrc99A. pKK223-3. pKK233-3. pDR540. pRIT5 (available 
25 trom Pharmacia). Useful eukaryotic vectors include pWLneo. pSV2cat. pOG44. pXTI . pSG (available from Stratagene) 
pSVK3. pBPV. pMSG. pSVL (available from Pharmacia). 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other 
• vectors with selectable markers. Two appropriate vectors are pKK232.8 and pCM7. Particular named bactenal pro- 
moters include lad. lacZ. T3. T7. gpt. lambda PR. and trc. Eukaryotic promoters include CMV immediate eariy HSV 
30 thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein- 1. Selection of the appropriate 
vector and promoter is well within the level of ordinary skill in the art. 

The present invention further provides host cells containing any one of the isolated fragments of the Staphylococcus 
aureus genomic fragments and contigs of the present invention, wherein the fragment has been introduced into the 
host cell using known methods. The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
33 . eukaryotic host cell, such as a yeast cell, or a procaryotic cell, such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct comprising an ORP of the present 
invention may be introduced into the host by a variety of well established techniques that are standard in the art. such 
as calcium phosphate transfection, DEAE. dextran mediated transfection and electroporation, which are described tn. 
for instance. Davis. L. et ai, BASIC METHODS IN MOLECULAR BIOLOGY (1 986). 
40 A host cell containing one of the fragments of the Staphyiococcus aureus genomic fragments and contigs of the 

present i.nvention. can be used in conventional manners to produce the gene product encoded by thejsojaled fragment 
(in the case of an ORP) or can be used to produce a heterologous protein under the control of the EMP 

The present invention further provides isolated polypeptides encoded by the nucleic acid fragments of the present 
invention or by degenerate variants of the nucleic acid fragments of the present invention. By "degenerate vananf is 
45 intended nucleotide fragments which differ from a nucleic acid fragment of the present invention {e.g., an ORP) by 
nucleotide sequence but. due to the degeneracy of the Genetic Code, encode an identical polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORPs depicted in Tables 2 and 3 which encode 
proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the isolated polypeptides or proteins 
so of the present invention. At the simplest level, the amino acid sequence can be synthesized using commercially avail- 
able peptide synthesizers. This is particularly useful in producing small peptides and fragments of larger polypeptides. 
Such short fragments as may be obtained most readily by synthesis are useful, for example, in generating antibodies 
aaainst the native polypeptide, as discussed further below. 

In an alternative method, the polypeptide or protein is purified from bacterial cells which naturally produce the 
55 polypeptide or protein. One skilled in the art can readily employ well-known methods for isolating polpeptides and 
proteins to isolate and purify polypeptides or proteins of the present invention produced naturally by a bacterial strain, 
or by other methods. Methods for isolation and purification that can be employed in this regard include, but are not 
limited to, immunochromatography. HPLC. size-exclusion chromatography ion-exchange chromatography, and immu- 
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may. also be employed as a matter of choice. 

As a representative but non-iimiling example, useful expression vectors for bacterial use can comprise a selectable 
marker and bacterial origin of replication derived from commercially available plasmics comprising genetic elements 
of the well known cloning vector pBR322 {ATCC 37017). Such commercial vectors include, for example. pKK223-3 
5 (available form Pharmacia Fine Chemicals. Uppsala. Sweden) and GEM 1 (available from Promega Siotec. Madison. 
Wl. USA). These pBR322 'backbone' sections are combined with an appropriate promoter and the structural sequence 
to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, the 
selected promoter, where it is inducible, is derepressed or induced by appropriate means {e.g., temperature shift or 
10 chemical induction) and cells are cultured for an additional period to provide for expression of the induced gene product. 
Thereafter cells are typically harvested, generally by cenlrifugation. disrupted to release expressed protein, generally 
by physical or chemical means, and the resulting crude extract is retained for further purification. 

Various mammalian cell culture systems can also be employed to express recombinant protein. Examples of mam- 
malian expression systems include the COS-7 lines of monkey kidney fibroblasts, described in Gluzman. Ce//23: 175 
IS (1981). and other cell lines capable of expressing a compatible vector, for example, the Cl27. 3T3, CHO. HeLa and 
BHK cell lines. 

Mammalian expression vectors will comprise an origin of replication, a suitable promoter and enhancer, and also 
any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination 
sequences, and 5' flanking nontranscribed sequences. DNA sequences derived from the 5V40 viral genome, for ex- 
' 20 ample. SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required 
nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is usually isolated by initial extraction from 
cell pellets, followed by one or more salting-out. aqueous ion exchange or size exclusion chromatography steps. Mi- 
crobial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw 
25 cycling, sonication. mechanical disruption, or use of cell lysing agents. Protein refolding steps can be used, as neces- 
sary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can 
be employed for final purification steps. 

An additional aspect of the invention includes Staphylococcus aureus polypeptides which are useful as immuno- 
, diagnostic antigens and/or immunoprotective vaccines, collectively 'immunologically useful polypeptides". Such im- 
30 munologically useful polypeptides may be selected from the ORFs disclosed herein based on techniques well known 
in the art and described elsewhere herein. The inventors have used the following criteria to select several immunolog- 
ically useful polypeptides: 

As. is known in the art. an amino terminal type 1 signal sequence directs a nascent protein across the plasma and 
outer membranes to the exterior of the bacterial cell. Such outermembrane polypeptides are expected to be immuno- 
35 logically useful. According to Izard, J. W el al., Mol. Microbiol. 13. 755-773; (1994). polypeptides containing type I . 
signal sequences contain the following physical attributes: The length of the type I signal sequence is approximately 
15 to 25 primarily hydrophobic amino acid residues with a net positive charge in the extreme amino terminus: the 
central region of the signal sequence must adopt an alpha-helical conformation in a hydrophobic environment; and the 
region surrounding the actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 
and -3 positions. 

Also known in the art is the type IV signal sequence which is an example of the several types of functional signal 
sequences which exist in addition to the type I signal sequence detailed above. Although functionally related, the r/pe 
IV signal sequence possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, S.. J. Bac- 
terid. 174. 7345-7351; 1992)). These are typically six to eight amino acids with a net basic charge followed by an 

^5 additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal sequence is typically 
after the initial six to eight amino acids at the extreme amino terminus. In addition, all type IV signal sequences contain 
a phenylalanine residue at the +1 site relative to the cleavage site. 

Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors has allowed the definition of a consensus 
amino acid sequence for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors examined 

50 contained the sequence L-(A,S)-(G.A)-C at positions -3 to +1. relative to the point of cleavage (Hayashi. S. and Wu. 
H. C. Lipoproteins in bacteria. J Bioenerg. Biomembr 22, 451-471; 1990). 

It well known that most anchored proteins found on the surface of gram-positive bacteria possess a highly con- 
served carboxy terminal sequence. More than fifty such proteins from organisms such as S. pyogenes, S. mutans, E. 
faecalis, S. pneumoniae, and others, have been identified based on their extracellular location and carboxy temninal 

55 amino acid sequence (Fischetti, V. A. Gram-positive commensal bacteria deliver antigens to elicit mucosal and systemic 
immunity. ASM News 52, 405410; 1 996)..The consen/ed region is comprised of six charged amino acids at the extreme 
carboxy terminus coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. Im- 
mediately adjacent to the transmembrane domain is a six amino acid sequence conserved in nearly all proteins ex- 
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specificity of antigen-antibody interaction at the level of individual amino acids. Proc. Natl. Acad. Sci. USA S2: 
5131-5135: this -Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 
4.531.211 to Houghten et al. (1985). Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known in the art. See. for instance. Sutciiffe et al.. supra: Wilson et al. supra: 

5 Chow. M. et al.. Prcc. Natl. Acad. Sci. USA 82:910-914: and Sittle. F. J. et al.. J. Gen. Virol. 55:2347-2354 (1955). 

Immunogenic epitope-bearing peptides of the invention, i.e. . those parts of a protein that elicit an antibody response 
when the whole protein is the immunogen. are identified according to methods known in the art. See. for instance. 
Geysen et al.. supra. Funher still. U.S. Patent No. 5.1 94.392 to Geysen (1 990) describes a general method of detecting 
or determining the sequence of monomers (amino acids or other compounds) which is a topological equivalent of the 

10 epitope (i.e.. a 'mimotope') which is complementary to a particular paratope (antigen binding site) of an antibody of 
interest. More generally. U.S. Patent No. 4.433.092 to Geysen (1989) describes a method of detecting or determining 
a sequence of monomers which is a topographical equivalent of a ligand which is complementary to the iigand binding 
site of a particular receptor of interest. Similarly U.S. Patent No. 5.430.971 to Houghten. R. A. et al. (1995) on Per- 
alkylated Oligopeptide Mixtures discloses linear Cl -C7-alkyl peralkylated oligopeptides and sets and libraries of such 

IS peptides, as well as methods for using such oligopeptide sets and libraries for determining the sequence of a per- 
alkylated oligopeptide that preferentially binds to an acceptor molecule of interest. Thus, non-peplide analogs of the 
epitope-bearing peptides of the invention also can be made routinely by these methods. 

Table 4 lists immunologically useful polypeptides identified by an algorithm which locates novel Staphylococcus 
aureus outermembrane proteins, as is described above. Also listed are epitopes or "antigenic regions' of each of the 

20 identified polypeptides. The antigenic regions, or epitopes, are delineated by two numbers x-y. where x is the number 
of the first amino acid in the open reading frame included within the epitope and y is the number of the last amino acid 
in the open reading frame included within the epitope. For example, the first epitope in ORF 168-6 is comprised of 
amino acids 36 to 45 of SEQ ID NO:5.l92. as is described in Table 4. The inventors have identified several epitopes 
for each of the antigenic polypeptides identified in Table 4. Accordingly, forming part of the present invention are 

25 polypeptides comprising an amino acid sequence of one or more antigenic regions identified in Table 4. The invention 
further provides polynucleotides encoding such polypeptides. 

The present invention further includes isolated polypeptides, proteins and nucleic acid molecules which are sub- 
stantially equivalent to those herein described. As used herein, substantially equivalent can refer both to nucleic acid 
and amino acid sequences, for example a mutant sequence, that varies from a reference sequence by one or more 

30 substitutions, deletions, or additions, the net effect of which does not result in an adverse functional dissimilarity be- 
tween reference and subject sequences. For purposes of the present invention, sequences having equivalent biological 
activity, and equivalent expression characteristics are considered substantially equivalent. For purposes of determining 
equivalence, truncation of the mature sequence should be disregarded. 

The invention further provides methods of obtaining homologs from other strains of Staphylococcus aureus, of the 

35 fragments of the Staphylococcus aureus genome of the present invention and homologs of the proteins encoded by 
the ORFs of the present invention. As used herein, a sequence or protein of Staphylococcus aureus is defined as a 
homolog of a fragment of the Staphylococcus aureus fragments or contigs or a protein encoded by one of the ORFs 
of the present invention, if it shares significant homology to one of the fragments of the Staphylococcus aureus genome 
of the present invention or a protein, encoded by one of the ORFs of the present invention. Specifically, by using the 

-io sequence disclosed herein as a probe or as primers, and techniques such as PGR cloning and colony/plaque hybrid- 
ization, one skilled in the art can obtain homologs. 

As used herein, two nucleic acid molecules or proteins are said to 'share significant homology" if the two contain 
regions which prossess greater than 65% sequence (amino acid or nucleic acid) homology. Preferred homologs in this 
regard are those with more than 90% homology. Especially preferred are those with 93% or more homology. Among 
especially preferred homologs those with 95% or more homology are particularly preferred. Very particularly preferred 
among these are those with 97% and even more particularly preferred among those are homologs with 99% or more 
homology. The most preferred homologs among these are those with 99.9% homology or more. It will be understood 
that, among measures of homology, identity is particularly preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence provided in SEQ ID NOS:1 -5, 1 91 or from 

50 a nucleotide sequence at least 95%. particularly at least 99%, especially at least 99.5% identical to a sequence of SEQ 
ID NOS:l-5.l9l can be used to prime DNA synthesis and PGR amplification, as well as to identify colonies containing 
cloned DNA encoding a homolog. Methods suitable to this aspect of the present invention are well known and have 
been described in great detail in many publications such as. for example, Innis et ai, PGR PROTOGOLS. Academic 
Press. San Diego. GA (1990)). 

55. When using primers derived from SEQ ID NOS: 1-5,1 91 or from a nucleotide sequence having an aforementioned 

identity to a sequence of SEQ ID NOS:1-5.191, one skilled in the art will recognize that by employing high stringency 
conditions [e.g., annealing at SO-SO^'G in 5X SSPG and 50% formamide. and washing at 50- 55''G in 0.5X SSPG) only 
sequences which are greater than 75% homologous to the primer will be amplified. By employing lower stringency 
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quantitative determination of glucose in body fluids recently in biotechnology for analyzing syrups from starch and 
cellulose hydrosylaies. This application is described in Owusu ef a/.. Biochem. et Biophysics, Acra. £72: S3 (l 93S). for 
instance. 

The main sweetener used in the world today is sugar which comes from sugar beets and sugar cane. In the field 

5 of industrial enzymes, the glucose isomerase process shows the largest expansion in the market today Initially, soluble 
enzymes were used and later immobilized enzymes were developed (Krueger et ai. Biotechnology. The Textbook of 
Industrial Microbiology. Sinauer Associated Incorporated. Sunderland. MassachuseUs (1990)). Today, the use of glu- 
cose- produced high fructose syrups is by far the largest industrial business using immobilized enzymes. A review of 
the industrial use of these enzymes is provided by Jorgensen. Starch 40:307 (198S). 

70 Proteinases, such as alkaline serine proteinases, are used as detergent additives and thus represent one of the 

largest volumes of microbial enzyrries used in the industrial sector Because of their industrial importance, there is a 
large body of published and unpublished information regarding the use of these enzymes in industrial processes. (See 
Faultman el a/.. Acid Proteases Structure Function and Biology. Tang. J., ed., Plenum Press, New York (1977) and 
Godfrey et al. Industrial Enzymes. IVlacMillan Publishers, Surrey. UK (1983) and Hepner ef a/., Report Industrial En- 

J5 zymes by 1990, Hel Hepner & Associates. London (1935)). 

Another class of commercially usable proteins of the present invention are the microbial lipases, described by. for 
instance. Macrae eial. Philosophical Transactions of the Chiral Society of London 310:227 (1935) and Poserke. Jour- 
nal of the American Oil Chemist Society 61:1753 (1984). A major use of lipases is in the fat and oil industry for the 
production of neutral glycerides using lipase catalyzed inter-esterification of readily available triglycerides. Application 

20 of lipases include the use as a detergent additive to facilitate the removal of fats from fabrics in the course of the 
washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst for key steps in the synthesis of complex 
organic molecules is gaining popularity at a great rate. One area of great interest is the preparation of chiral interme- 
diates. Preparation of chiral intermediates is of interest to a wide range of synthetic chemists particulariy those scientists 

25 involved with the preparation of new pharmaceuticals, agrochemicals, fragrances and flavors. (See Davies et al. Re- 
cent Advances in the Generation of Chiral Intermediates Using Enzyrries. CRC Press, Boca Raton, Florida (1990)). 
The following reactions catalyzed by enzymes are of interest to organic chemists:hydrolysis of carboxylic acid esters, 
phosphate esters, amides and nitriles, esterification reactions, trans-esterification reactions, synthesis of amides, re- 
duction of alkanones and oxoalkanates, oxidation of alcohols to carbonyl compounds, oxidation of sulfides to sulfoxides, 
and carbon bond forming reactions such as the aldol reaction. 

When considering the use of an enzyme encoded by one of the ORFs of the present invention for biotransformation 
and organic synthesis it is sometimes necessary to consider the respective advantages and disadvantages of using a 
microorganism as opposed to an isolated enzyme. Pros and cons of using a whole cell system on the one hand or an 
isolated partially purified enzyme on the other hand, has been described in detail by Bud et ai. Chemistry in Britain 

35 (1987); p. 127. 

Amino transferases, enzymes involved in the biosynthesis and metabolism of amino acids, are useful in the catalytic 
production of amino acids. The advantages of using microbial based enzyme systems is that the amino transferase 
enzymes catalyze the stereo- selective synthesis of only L-amino acids and generally possess uniformly high catalytic 
rates. A description of the use of amino transferases for amino acid production is provided by Roselle-David. Methods 
^0 of Enzymoloqy^ 36 A79 (1987). 

Another category of useful proteins encoded by the ORFs of the present invention include enzymes involved in 
nucleic acid synthesis, repair, and recombination. A variety of commercially important enzymes have previously been 
isolated from members of Staphylococcus aureus. These include Sau3A and Sau96L 

-^5 2, Generation of Antibodies 

As described here, the proteins of the present invention, as well as homologs thereof, can be used in a variety 
procedures and methods known in the art which are currently applied to other proteins. The proteins of the present 
invention can further be used to generate an antibody which selectively binds the protein. Such antibodies can be 
so either monoclonal or polyclonal antibodies', as well fragments of these antibodies, and humanized forms. 

The invention further provides antibodies which selectively bind to one of the proteins of the present invention and 
hybridomas which produce these antibodies. A hybridoma is an immortalized cell line which is capable of secreting a 
specific monoclonal antibody. 

In general, techniques for preparing polyclonal and monoclonal antibodies as well as hybridomas caprable of pro- 
ducing the desired antibody are well known in the art (Campbell, A. M.. MONOCLONAL ANTIBODY TECHNOLOGY: 
LABORATORY TECHNIOUES IN BIOCHEMISTRY AND MOLECULAR BIOLOGY Elsevier Science Publishers. Am- 
sterdam. The Netherlands (1984); St. Groth etaL. J. Immunol Methods 25: 1-21 (1960), Kohler and Miistein, Nature 
255 : 495-497 (1975)). the Irioma technique, the human B- cell hybridoma technique (Kozbor et al. Immunoiogy Today 
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on ihe assay (ormat. nature of the detection method and the tissues, cells or extracts used as the sample to be assayed. 
Methods for preparing protein extracts or membrane extracts o( cells are well known in the an and can be readily be 
adapted in order to obtain a sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the necessary reagents to carry 

5 out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close confinement, one or more containers 
which comprises:(a) a first container comprising one of the Dfs. antigens or antibodies of the present invention: and 
(b) one or more other containers comprising one or more of the following: wash reagents, reagents capable of detecting 
presence of a bound DF. antigen or antibody. 

JO In detail, a companmenlalized kit includes any kit in which reagents are contained in separate containers. Such 

containers include small glass containers, plastic containers or strips of plastic or paper. Such containers allows one 
to efficiently transfer reagents from one compartment to another compartment such that the samples and reagents are 
not cross-contaminated, and the agents or solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept the lest sample, a container which 

15 contains the antibodies used in the assay, containers which contain wash reagents (such as phosphate buffered saline. 
Tris-buffers. etc.). and containers which contain the reagents used to detect the bound antibody, antigen or DF. 

Types of detection reagents include labelled nucleic acid probes. labelled secondary antibodies, or in the alterna- 
tive, if the primary antibody is labelled, the enzymatic, or antibody binding reagents which are capable of reacting with 
the labelled antibody. One skilled in the art will readily recognize that the disclosed Dfs, antigens and antibodies of the 

20 present invention can be readily incorporated into one of the. established kit formats which are well known in the art. 

4. Screening Assay for Binding Agents 

Using the isolated proteins of the present invention, the present invention further provides methods of obtaining 
25 and identifying agents which bind to a protein encoded by one of the ORFs of the present invention or to one of the 
fragments and the Staphylococcus aureus fragment and contigs herein described. 
In general, such methods comprise steps of: 

(a) contacting an agent with an isolated protein encoded by one of the ORFs of the present invention, or an isolated 
30 fragment of the Staphylococcus aureus genome; and 

(b) determining whether the agent binds to said protein or said fragment. 

The agents screened in the above assay can be. but are not limited to, peptides, carbohydrates, vitamin derivatives, 
or other pharmaceutical agents. The agents can be selected and screened at random or rationally selected or designed 
■j5 using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and the like are selected 
at random and are assayed for their ability to bind to the protein encoded by the ORFof the present invention. 

Alternatively, agents may be rationally selected or designed. As used herein, an agent is said to be Vationaliy 
selected or designed' when the agent is chosen based on the configuration of the particular protein. For example, one 
^0 skilled in the art can readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
tike capable of binding to a specific peptide sequence in order to generate rationally designed antipeptide peptides, 
for example see Hurby et ai. Application of Synthetic Peptides: Antisense Peptides,' In Synthetic Peptides, A Users 
Guide. W. H. Freeman. NY {1 992). pp. 289-307. and Kaspczak ef a/.. Biochemistry 28:9230-8 (1 989). or pharmaceutical 
agents, or the like. 

^5 In addition to the foregoing, one class of agents of the present invention, as broadly described, can be used to 

control gene expression through binding to one of the ORFs or EMFs of the present invention. As described above, 
such agents can be randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a skilled 
artisan to design sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. 

50 One class of DNA binding agents are agents which contain base residues which hybridize or form a triple helix by 

binding to DNA or RNA. Such agents can be based on the classic phosphodiester. ribonucleic acid backbone, or can 
be a variety of sulfhydryl or polymeric derivatives which have base attachment capacity 

Agents suitable for use in these methods usually contain 20 to 40 bases and are designed to be complementary 
to a region of the gene involved in transcription (triple helix - see Lee et ai, Nucl. Acids Res. 6:3073 (1979); Cooney 

5^ etai. Science 241:455 (1988): and Oervan etal. Science 251: 1350 (1991)) or to the mRNA itself (antisense - Okano. 
J. Neurochem. 55:550 (1991): Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press. Boca 
Raton. FL (1988)). Triple helix -format ion optimally results in a shut-off of RNA transcription from DNA, while antisense 
RNA hybridization blocks translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated 
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be measured at the same time. The composition of the present invention can be administered concurrently with, prior 
to, or following the administration of the other agent. 

The agents of the present invention are intended to be provided to recipient subjects in an amount sufficient to 
decrease the rate of growth (as defined above) of the target organism. 

The administration of the agent{s) of the invention may be for either a 'prophylactic' or "therapeutic' purpose. 
When provided prophylactically. the agent(s) are provided in advance of any symptoms indicative of the organisms 
growth. The prophylactic administration of the agent(s) serves to prevent, attenuate, or decrease the rate of onset of 
any subsequent infection. When provided therapeutically, the agent(s) are provided at (or shortly after) the onset of an 
indication of infection. The therapeutic administration of the compound(s) serves to attenuate the pathological symp- 
toms of the infection and to increase the rate of recovery. 

The agents of the present invention are administered to a subject, such as a mammal, or a patient, in a pharma- 
ceutically acceptable form and in a therapeutically effective concentration. A composition is said to be 'pharmacolog- 
ically acceptable' if its administration can be tolerated by a recipient patient. Such an agent is said to be administered 
in a 'therapeutically effective amount' if the amount administered is physiologically significant. An agent is physiolog- 
ically significant if its presence results in a detectable change in the physiology of a recipient patient. 

The agents of the present invention can be formulated according to known methods to prepare pharnnaceutically 
useful compositions, whereby these materials, or their functional derivatives, are combined in admixture with a phar- 
maceutically acceptable carrier vehicle. Suitable vehicles and their formulation, inclusive of other human proteins, e. 
g., human serum albumin, are described, for example, in REMINGTON'S PHARMACEUTICAL SCIENCES, 16* Ed.. 
Osol. A.. Ed., Mack Publishing. Easton PA (1980). In order to form a pharmaceutically acceptable composition suitable 
for effective administration, such compositions will contain an effective amount of one or more of the agents of the 
present invention, together with a suitable amount of carrier vehicle. 

Additional pharmaceutical methods may be employed to control the duration of action. Control release preparations 
may be achieved through the use of polymers to complex or absorb one or more of the agents of the present invention. 
The controlled delivery may be effectuated by a variety of well known techniques, including formulation with macro- 
molecules such as. for example, polyesters, polyamino acids, polyvinyl, pyrrolidone. elhylenevinylacetate. methylcel- 
lulose. carboxymethylcellulose, or protamine, sulfate, adjusting the concentration of the macromolecules and the agent 
in the formulation, and by appropriate use of methods of incorporation, which can be manipulated to effectuate a desired 
tirne course of release. Another possible method to control the duration of action by controlled release preparations is 
to incorporate agents of the present invention into particles of a polymeric material such as polyesters, polyamino 
acids, hydrogels, poly(lactic acid) or ethylene vinylacetate copolymers. Alternatively, instead of incorporating these 
agents into polymeric particles, it is possible to entrap these materials in microcapsules prepared, for example, by 
coacen/ation techniques or by interfacial polymerization with, for example, hydroxymethylcellulose or gelatine-micro- 
capsules and poly(methylmethacylate) microcapsules, respectively, or in colloidal drug delivery systems, for example, 
liposomes, albumin microspheres, microemulsions, nanoparticles, and nanocapsules or in macroemulsions. Such tech- 
niques are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES. (1980). 

The invention further provides a pharmaceutical pack or kit comprising one or more containers filled with one or 
more of the ingredients of the pharmaceutical compositions of the invention. Associated with such container(s) can be 
a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals 
or biological products, which notice reflects approval by the agency of. manufacture, use or sale'for human adminis- 
tration. 

In addition, the agents of the present invention may be employed in conjunction with other therapeutic compounds. 
6. Shot-Gun Approach to Megabase ONA Sequencing 

The present invention further demonstrates that a large sequence can be sequenced using a random shotgun 
approach. This procedure, described in detail in the examples that follow, has eliminated the up front cost of isolating 
and ordering overlapping or contiguous subclones prior to the start of the sequencing protocols. 

Certain aspects of the present invention are described in greater detail in the examples that follow. The examples 
are provided by way of illustration. Other aspects and embodiments of the present invention are contemplated by the 
inventors, as will be clear to those of skill in the art from reading the present disclosure. 
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mercaptoethanoi was added to the altquol of cells to a final concentration of 25 mM, Cells were incubated on ice for 
10 min. A "I ul aliquot of the final ligation was added to the cells and incubated on ice for 30 min. The cells were heal 
pulsed for 30 sec. at 42* C and placed back on ice for 2 min. The outgrowth period in liquid culture was eliminated 
from this protocol in order to minimize the preferential growth of any given transformed cell. Instead the transformation 

5 mixture was plated directly on a nutrient rich SOB plate containing a 5 ml bottom layer of SOB agar {5% SOB agar: 
20 g tryptone. 5 g yeast extract. 0.5 g NaCI. 1 .5% Difco Agar per liter of media). The 5 ml bottom layer is supplemented 
with 0.4 ml of 50 mg/ml ampicillin per 100 ml SOB agar. The 15 ml top layer of SOB agar is supplemented with l ml 
X-Gal (2%). 1 ml MgCl2 (1 M), and 1 ml MgSO4/100 ml SOB agar The 15 ml top layer was poured just prior to plating. 
Our titer was approximately 100 colonies/10 ul aliquot of transformation. 

10 All colonies were picked for template preparation regardless of size. Thus, only clones lost due to 'poison' DNA 

or deleterious gene products would be deleted from the library, resulting in a slight increase in gap number over that 
expected. 

3. Random DNA Sequencing 

15 

High quality double stranded DNA plasmid templates were prepared using an alkaline lysis method developed in 
collaboration with SPrime — ^ 3Prime Inc. {Boulder. CO). Plasmid preparation was performed in a 96-well format for all 
stages of DNA preparation from bacterial growth through final DNA purification. Average template concentration was 
determined by running 25% of the samples on an agarose gel. DNA concentrations were not adjusted. 

20 Templates were also prepared from a Staphylococcus aureus lambda genomic library. An unamplified library was 

constructed in Lambda DASH II vector (Slratagene): Staphylococcus aureus DNA (> 100 kb) was partially digested in 
a reaction mixture (200 ul) containing .50 ug DNA. IX SauSAI buffer, 20 units Sau3Al for 6 min. at 23 C. The digested 
DNA was phenol-extracted and centrifuges over a 10- 40% sucroce gradient. Fractions containing genomic DNA of 
15-25 kb were recovered by precipitation . One ul of fragments was used with 1 ul of DASHII vector (Slratagene) in 

25 the recommended ligation reaction. One ul of the ligation mixture was used per packaging reaction following the rec- 
ommended protocol with the Gigapack II XL Packaging Extract Phage were plated directly without amplification from 
the packaging mixture (after dilution with 500 ul of recommended SM buffer and chloroform treatment). Yield was about 
2.5x109 pfu/ul. 

An amplified library was prepared from the primary packaging mixture according to the manufactureer's protocol. 

30 The amplified library is stored frozen in 7% dimethylsulfoxide. The phage titer is approximately 1x10^ pfu/ml. 

Mini-liquid lysates (0.1 ul) are prepared from randomly selected plaques and template is prepared by long range 
PCR. Samples are PCR amplified using modified T3 and T7 primers, and Elongase Supermix (LTI). 

Sequencing reactions are carried out on plasmid templates using a combination of two workstations (BIOMEK 
1000 and Hamilton Mtcrolab 2200) and the Perkin-Elmer 9600 thermocycler with Applied Biosystems PRISM Ready 

35 Reaction Dye Primer Cycle Sequencing Kits for the Ml 3 forward (Ml 3-21) and the Ml 3 reverse (M13RP1) primers. 
Dye terminator sequencing reactions are carried out on the lambda templates on a Perkin-Elmer 9600 Thermocycler 
using the Applied Biosystems Ready Reaction Dye Terminator Cycle Sequencing kits. Modified T7 and T3 primers are 
used to sequence the ends of the inserts from the Lambda DASH II library. Sequencing reactions are on a combination 
of AB 373 DNA Sequencers and ABI 377 DNA sequencers. All of the dye terminator sequencing reactions are analyzed 

^0 using the 2X 9 hour. module on the AB 377. Dye primer reactions are analyzed on a combination of ABI 373 and ABI 
377 DNA sequencers. The overall sequencing success rate very approximately is about 85% for Ml 3-21 and M13RP1 
sequences and 65% for dye-terminator reactions. The average usable read length is 485 bp for Ml 3-21 sequences, 
445bp for M13RP1 sequences, and 375 bp for dye-terminator reactions. 

4. Protocol tor Automated Cycle Sequencing 

The sequencing was carried out using Hamilton Microstation 2200. Perkin Elmer 9500 thermocyclers, ABI 373 
and ABI 377 Automated DNA Sequencers. The Hamilton combines pre-aliquoted templates and reaction mixes con- 
sisting of deoxy- and dideoxynucleotides. the thermostable Taq DNA polymerase, fiuorescently-labelled sequencing 
so primers, and reaction buffer. Reaction mixes and templates were combined in the wells of a 95-well Ihermocycling 
plate and transferred to the Perkin Elmer 9500 thermocycler. Thirty consecutive cycles of linear amplification (i.e.., one 
primer synthesis) steps were performed including denaturation, annealing of primer and template, and extension: i.e.. 
DNA synthesis. A heated lid with rubber gaskets on the thermocycling plate prevents evaporation without the need for 
an oil overlay. 

55 Two sequencing protocols were used: one for dye-latDelled primers and a second for dye-labelled dideoxy chain 

terminators. The shotgun sequencing involves use of four dye-Iabelled sequencing primers, one for each of the four 
terminator nucleotide. Each dye-primer was labelled with a different fiuorescent dye, permining the four individual 
reactions to be combined into one lane of the 373 or 377 DNA Sequencer for electrophoresis, detection, and base- 
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ORFs of at least 120 amino acids that did not match protein or nucleotide sequences in the databases at these levels 
are shown in Table 3. 

ILLUSTRATIVE APPLICATIONS 

1. Production of an Antibody to a Staphylococcus aureus Protein 

Substantially pure protein or polypeptide is isolated from the iransfected or transformed cells using any one of the 
methods known in the art. The protein can also be produced in a recombinant prokaryotic expression system, such as 
E. CO//, or can by chemically synthesized. Concentration of protein in the final preparation is adjusted, for example, by 
concentration on an Amicon filter device, to the level of a few microg rams/ml. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows. 

2. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can be prepared from 
murine hybridomas according to the classical method of Kohler. G. and Milstein, C. Nature 255:495 (1975) or modifi- 
cations of the methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein 
over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the spleen isolated. 
The spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and the excess unfused cells 
destroyed by growth of the system on selective media comprising aminopterin (HAT media). The successfully fused 
cells are diluted and aliquots of the dilution placed in wells of a microliter plate where growth of the culture is continued. 
Antibody-producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay 
procedures, such as ELISA, as originally described by Engvall, E.. Meth. Enzymol. 70:41 9 (1 980). and modified meth- 
ods thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested for use. 
Detailed procedures for monoclonal antibody production are described in Davis. L et ai Basic Methods in Molecular 
Biology Elsevier, New York. Section 21 -2 (1 9B9). 

3. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a single protein can be prepared by im- 
munizing suitable animals with the expressed protein described above, which can be unmodified or modified to enhance 
immunogenicity. Effective polyclonal antibody production is affected by many factors related both to the antigen and 
the host species. For example, small molecules tend to be less immunogenic than other and may require the use of 
carriers and adjuvant. Also, host animals vary in response to site of inoculations and dose, with both inadequate or 
excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigenadministered at multiple 
intradermal sites appears to be most reliable. An effective immunization protocol for rabbits can be found in Vaitukaitis. 
J. etal.,J. Clin. Endocrinol. Metab. 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when antibody titer thereof, as de- 
termined semi-quantitatively, for example, by double immunodiffusion in agar against known concentrations of the 
antigen, begins to fall. See. for example. Ouchterlony O. et al.. Chap. 19 in:Handbook of Experimental Immunology. 
Wier. D.. ed, Blackwell (1973). Plateau concentration of antibody is usually in the range of 0. 1 to 0. 2 mg/ml of serum 
(about 1 2M). Affinity of the antisera for the antigen is determined by preparing competitive binding curves, as described, 
for example, by Fisher. D.. Chap. 42 in:Manual of Clinical Immunology second edition. Rose and Friedman, eds., Amer. 
Soc. For Microbiology. Washington, D. C. (1980) 

Antibody preparations prepared according to either protocol are useful in quantitative immunoassays which de- 
termine concentrations of antigen-bearing substances in biological samples: they are also used semi- quantitatively 
or qualitatively to identify the presence of antigen in a biological sample. In addition, they are useful in various animal 
models of Staphylococcal disease known to those of skill in the art as a means of evaluating the protein used to make 
the antibody as a potential vaccine target or as a means of evaluating the antibody as a potential immunothereapeutic 
reagent. 

3. Preparation of PGR Primers and Amplification of DNA 

Various fragments of the Staphylococcus aureus genome, such as those of Tables 1 -3 and SEQ ID NOS: 1 -5, 1 91 
can be used, in accordance with the present invention, to prepare PCR primers for a variety of uses. The PCR primers 
are preferably at least 15 bases, and more preferably at least IS bases in length. When selecting a primer sequence, 
it is preferred that the primer pairs have approximately the same G/C ratio, so that melting temperatures are approxi- 
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GCTAATGGGA AATATGGAAT AAATGTGATT TGGTGATCAA CACAATATTG TAATACTGCC 2160 

TCAT7TTCGC GATGCAATAA ATTATATTCT AACTGTACAA CATCAACGTA ACCATCTTTA 2 220 

5 TTTGCTTCTT TAf^GTTGATC TAATGTGAAA TTTGATACAC CAATTGC7TT AATCTTCCCT 2280 

TGTTCCTTAA GCTCTTGTAA TGCTGCAACT GCTTGATCTT TCGGAGTGTT GTTATCCGGA 234.0 

AAATGAATAT AATATAAATC GATATAATCA GTTTGTAGAC GTTTCAAACT ATTCTCAACT 2400 

10 

TGTTGTTTTA AATATTCCGG TTGATTGTTC TGATG7ACTT CTTGATTTTC ATCAAATTCA 24 60 

TGAGACCCTT TCGTAGCAAT TTTAATTTGC TCTCGCGGAT ATTCTTTAAC AACTTCTCCA 2 520 

ACCAATTCTT CTGATCGTTC TGGCCCATAA ATATATGCCG TATCTAATAA ATTAATACCA 2580 

IS 

TGATTAATGG CTTGACGAAC AACATCTTTT CCTTGTTCTT CATCTAAGTT CGGATATAAA 264 0 

TTATGCCCAa CCTAtGCGTT CGTCCCAAGT GCGATTGGAA ACACTTCAAC ATCAGATTTA 2700 

2^ CCTAAGTTTA CAAATTGCTn CATTAGACCC AGCnCCTT ' 273 8 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9425 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(Dl TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

GATTAGATGA TATTTAACGA AAATTAaGrT GmAATACTtG AATGTArGAa GTCTGATGTC 60 

GAAAATAGCT ATTAAAATAG AGTAGACGTA ATGCAAATGA AAGCACCTAA AATAGAAAAA 120 

3S 

TTTCAAAAAT AGCGTAATTA TTATAATAAA TAGACTGCCA ATAAAATGCA ATTTTTCACT 180 

TATAACAITC TTCAAAAAAT AATAGCAAAA TTATGTAAAA AATATCTTGT CATGGCAAGA 240 

TTGGCTGTGC TATAATCTAT CTTGTGCTTA AGAACGGCTC CTTGGTCAAG CGGTTAAGAC 30 0 

40 : 

ACCGCCCTTT CACC^CGCTA ACACGGGTTC GAGTCCCGTA GGAGTCACCA TTTTTTAGGT 360 

CTCGTAGTGT AGCGGTTAAC ACGCCTGCCT GTCACGCAGG AGATCGCGGG TTCGATTCCC 420 

GTCGAGACCG TACAAATGCC TATCCAAGAG GATAGGCATT TTTTTGCGTT TAATATTATA 480 

TTAATAAAAG ATATATGGAC GAATGATAAT CATATTGATT TATCTGTTCG TCCATTTTCT 54 0 

TTAAAATGTA TGAACCTCAA GTAACTTAGT GGTTGOATAT GAAAGATAAA CGTAGACAAT 600 

SO AAAATCTTTA TTAGACGTAC AAACATATGC TACTGTCAAC ATATTTCTTC GTTGTGATAT 560 

GCCACCAGTC CTCCATAACA TCAATTGTTA AAGTAACGAA TAACGAATAA TGATATTTAT 720 
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GTGATGGTAT 7ATTGCAACA GACCGCCGTG GACGTATTCG TATCGTCAAT GATATGGCAC. 2S4 0 

TCAAGATGCT TGGTATGGCG AAAGAAGACA TCATCGGATA TTACATGTTA AGTGTATTAA 2700 

5 

GTCTTGAAGA TGAATTTAAA CTGGAAGAAA TTCAAGAGAA TAATGATAG7 TTCTTATTAG 2760 

ATTTAAATGA AGAAGAAGGT CTAATCGCi^C GTGTTAACTT TAGTACGATT GTGCAGGAAA 2 820 

CAGGATTTGT AACTGGTTAT ATCGCTGTGT TACATGACGT AACTGAACAA CAACAAGTTG 28 80 

10 

AACGTGAGCG TCGTGAATTT GTTGCCAATG TATCACATGA GTTACGTACA CCTTTAACTT 2 94 0 

CTATGAATAG T7ACATTGAA GCACTTGAAG AAGGTGCATG GAAAGATGAG GAACTTGCGC 3000 

CACAATTTTT ATGTGTTACC CGTGAAGAAA CAGAACGAAT GATTCGACTG GTCAATGACT 3 060 

TGCTACAGTT ATCTAAAATG GATAATGAGT CTGATCAAAT CAACAAAGAA ATTATCGACT 3120 

T7AACATGTT CATTAATAAA ATTATTAATC GACATGAAAT GTCTGCGAAA GATACAACAT 3180 

20 TTATTCGAGA TATTCCGAAA AAGACGATTT TCACAGAATT TGATCCTGAT AAAATGACGC 324 0 

AAGTATTTGA TAATGTCATT ACAAATGCGA TGAAATATTC TAGAGGCGAT AAACGTG7CG 3 300 

AGTTCCACGT GAAACAAAAT CCACTTTATA ATCGAATGAC GATTCGTATT AAAGATAATG 3 3 60 

GCATTGGTAT TCCTATCAAT AAAGTCGATA AGATATTCGA CCGATTCTAT CGTGTAGATA 3420 

AGGCACGTAC GCGTAAAATG GGTGGTACTG GATTAGGACT AGCCATTTCG AAAGAGATTG 34 8 0 

TGGAAGCGCA CAATGGTCGT ATTTGGGCAA ACAGTGTAGA AGGTCAAGGT ACATCTATCT 3 54 0 

30 

TTATCACACT TCCATGTGAA GTCATTGAAG ACGGTGATTG GGATGAATAA TAAGGAGCAT - 3 500 

ATTAAATCTG TCATTTTAGC ACTACTCGTC .TTGATGAGTG TCGTATTGAC A7ATATGGTA . 3 66 0 

TGGAACTTT7 C7CCTGATAT TGCAAATGTC GACAATACAG ATAGTAAGAA GAGTGAAACG 3720 

rAACCTTTAA CGACACCTAT GACAGCCAAA ATGGATACAA CTATTACGCC ATTTCAGATT 3 7 80 
ATTCATTCGA AAAATGATCA TCCAGAAGGA ACGATTGCGA CGGTATCTAA ' TGTGAATAAA * 3 84 0 

CTGACGAAAC CTTTGAAAAA TAAAGAAGTG AAGTCCGTGG AACATGTTCG TCGTGATCAT 3 900 

40 ' 

AACTTGATGA TTCCTGATTT GAACAGTGAT TTTATATTAT TCGATTTTAC GTATGATTTA 3 960 

CCGTTATCAA CATATCTTGG TCAAGTACTG AACATGAATG CGAAAGTACC AAATCATTTC 4 020 

4S AATTTCAATC G'lTI'GGTGAT AGATCATGAT GCTGATGATA ATATCGTGCT TTATGCTATA 4 080 

AGCAAAGATC GCCACGATTA CGTAAAATTA ACAACTACAA CGAAAAATGA TCATTTTTTA 414 0 

GATGCATTAG CAGCAGTGAA AAAAGATATG CAACCATACA CAGATATCAT CACAAACAAA 4 200 

GATACAATTG A7CGTACGAC GCATGTTTTT GCACCAAGTA AACCTGAAAA G7TAAAAACA 4 2 60 

TATCGCATGG TATTTAACAC GATTAGTGTT GAGAAAATGA ATGCTATACT AT7TGACGAT 4 3 20 
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TAGTTGATGT TGGTTTGACT GGAAAGAAAA 
ATATTCAAGA TTTAAATGGT ATTrTAGTAA 
^ TAGGTGTTTT GGCGCGTAAA TATCAATTGC 

CAATTGAAAA GAAAGATAGT CGCATCCCTA 
AAACAAAATC TATTGCAGGT TTCGATGTTG 

10 

ATCCGCAATT TTATATTTTC CATAATAACT 
GTTACGTGTC TGATCGTATG AAAGGTATGA 
GTAATCATGA CGTCGATATG TTGAGAATGT 

15 

TTTTAGGCGA TATGGGTCAT GTATCTAATG 
TTACAGGTAA. CACGAAACGT ATTTACCTAT 

20 ATTTGGCGCG TATGAGTGTT GGCCAAGTAT 

AAGTATTGCT ATGTGATACG GATAAAGCTA 
GAGTCATCCG ATAAAGTTCC GCATTGCTGT 

25 TGTTGGTGGG AAATGGCTGT TGTTGAGTTG 

TCGATATTAA ATGTAATTTA TAAATAATTT 
TATGATAATA TATTGGTGTA TGACAGTTAA 
TAAAACAAGT GTATGGCTCG TTTTGCTTTT 
CGCGGCTGAG CAGCATACAC CAATGAAAGC 
AACAGATAAG CAACAAGTAC CGCCAACAAA 

3$ 

GGCAACCAAC GTATCAGCAT CAGCGCAGGG 
ATCC^CXJCA CCATCTAACA AACCATCTAC 

CGACGTAGAT ACACAACAAG CCTCAACACA 

40 . 

ATCAAATGCr AAAACAGCAT CACTTTGACC 
AACAACACAT AAAATATTAC ATACAAATGA 

^ GCGTGTCATC GGTATGGCTA AATTAAAAAC 

GTTAGACGCA GGAGACGCCT TCCAAGGTTT 
AATGGCTAAA GCAATGAATG CAGTAGGTTA 

.50 TGACTTTGGA TACGATCAGT TGAAAAAGTT 

TACTAACGTT TATAAAGATG GAAAACGCGC 



TGGAAGAATT GTTTAGi'CAA ATTGACCGTA S24 0 

CCCATGAACA TATTGATCAT ATTAAAGGAT 6300 

CAATTTATGC GAATGAAAAA ACTTGGCAGG 5 3 60 

TGGATCAGAA ATTCATTTTT AATCCTTATG 64 20 

AATCGTTTAA CGTGTCACAT GATGCAATAG 64 80 

ATAAGAAGTT TACGATTTTA ACGGATACGG 6 540 

TACGTGGCAG CGATGCGTn ATTTTTGAGA 6 600 

GTCGTTATCC ATGGAAGACG AAACAACGTA 6 660 

AGGATGCGGC TCATGCAATG ACAGACGTGA 6720 

CGCATTTATC ACAAGACAAT AACATGAAAG 67 80 

TGAACGAACA CGATATTGAT ACGGAAAAAG 6 840 

TTCCAACGCC AATATATACA ATATAAATGA 6 900 

GAGACGACTT TATCGGGTGC TTTTTTATGT 6960 

AATCGGCTTG ATTGAAATGT GTAAAATAAT 7020 

ACATAAAATC AATCATTTTA ATATAAGGAT 70 BO 

TGGAGGGAAC GAAATGAAAG CTTTATTACT 7140 

TAGTGTAATG GGATTATGGC AAGTCTCGAA 7200 

ACATGCAGTA ACAACGATAG ACAAAGCAAC 7260 

GGAAGCGGCT CATCATTCTG GCAAAGAAGC 7320 

AACAGCTGAT GATACAAACA GCAAAGTAAC 73 8 0 

AGTAGTTTCA ACAAAAGTAA ACGAAACACG 744 0 

AAAACCAACT CACACAGCAA CGTTCAAATT -7500 

ACGAATGTTT GCTGCTAATG CACCACAAAC 7560 

TATCCATGGC CGACTAGCCG AAGAAAAAGG 7620 

AGTAAAAGAA CAAGAAAAGC CTGATTTAAT 7680 

ACCACTTTCi^ AACCAGTCTA AAGGTGAAGA 7740 

TGATGCTATG GCAGTCGGTA ACCATGAATT 7 800 

AGAGGGTATG TTAGACTTCC CGATGCTAAG 7 860 

GTTTAAGCCT TCAACGATTG TAACAAAAAA 7 920 
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(2) INFORMATION FOR SSQ ID NO: 2975: 

(i) SEQUENCE CftARACTERISTICS: 

(A) LENGTH: 3S1 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



70 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 9-75: 





GAGATCGCGG 


GTTCGATTCC 


CGTCGAGACC 


GTACAAATGC 


CTATCCAAGA 


GATAGTTTTG 


60 


15 


TnTTGCGTTT 


AATATTATAT 


TAATAAAAGA 


TATATGGACG 


AATGATAATC 


ATATTGATTT 


120 




ATCTGTTCGT 


CCATTTTCTT 


TAAAATGTAT 


GAACCTCAAG 


TAACITAGTG 


GTTGGATATG 


130 




AAAGATAAAC 


GTnGACAATA 


AAATCTTTAT 


TAGACGTACA 


AACATATGCT 


ACTGTCAACA 


240 


20 


TATTTCTTCG 


TTGTGATATG 


CCACCAGTCC 


TCCATAACAT 


CAATTGTTAA 


AGTAACGAAT 


300 




AACGAATAAT 


GATATTTnAT 


TTCTGAGCCA 


TGACGTGCCA 


CTAGAAGTTG 


CCCATTATCC - 


350 




T 












361 



(2) INFORMATION FOR SEQ ID NO: 2976: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 56 base pairs 

(B) TYPE: nucleic acid 
^0 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2976: 

TGCTGCTTCT GAAGCGGAGT AATTGTGTAT ATCAAATGTC GCTTTTTGAT CAGGAACATG 60 

ATGTAAAGGT GTATATGTTT CACCTGTTTG CTCATCATGT AACACGGCAT GCCTATGACT 120 

40 GAATGTACCA CGTTCACTAT CTTGACCAGT TAAGCGAATC GGTGTACCAT CTTGTAAAAT 180 

TGTCGCAAAT GCAAGTnGTT CTGCTTGTGC CCAATCAACT AAACCATCTT CTTTATTAAA 24 0 

CGGCTCATGA CGCTTCTCAA GAACTTTGTT TAACTTTTm CAAAATGTTA AAGCCATCCG 3 00.. 

45 

GATATGTTAA CAATGCATCA TTATTTCITT CAAGTGAtCC AAAAGTAAAG GnTTGT 3 56 
(2) INFORMATION FOR SEQ ID NO: 2977: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D} TOPOLOGY: linear 
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Arg Asp Gly Met Glu Val Cys Arg Glu Val Arg Lys Lys Tyr Giu Met 
65 70 75 BO 

Pro lie lie Met Leu Thr Ala Lys Asp Ser Glu He Asp Lys Val Leu 
85 90 95 

Gly Leu Glu Leu Gly Ala Asp Asp Tyr Val Thr Lys Pro Phe Ser Thr 
100 105 110 

Arg Glu Leu He Ala Arg Val Lys Ala Asn Leu Arg Arg Kis Tyr Ser 
115 120 125 

Gin Pro Ala Gin Asp Thr Gly Asn Val Thr Asn Glu He Thr He Lys 
130 135 140 

Asp He Val He Tyr Pro Asp Ala Tyr Ser He Lys Lys Arg Gly Glu 
145 150 155 160 

Asp He Glu Leu Thr His Arg Glu Phe Glu Leu Phe His Tyr Leu Ser 
165 170 175 

Lys His Met Gly Gin Val Met Thr Arg Glu His Leu Leu Gin Thr Val 
180 185 190 

Trp Gly Tyr Asp Tyr Phe Gly Asp Val Arg Thr Val Asp Val Thr He 
195 200 205 

Arg Arg Leu Arg Glu Lys He Glu Asp Asp Pro Ser His Pro Glu Tyr 
210 215 220 

He Val Thr Arg Arg Gly Val Gly Tyr Phe Leu Gin Gin His Glu 

225 230 . - . 235 

INFORMATIOK FOR SEQ ID NO: 5240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5240: 

Xaa Leu Ser Thr Val He Gly Ala Xaa Leu Phe Phe Lys Ser Ser" Val 

15 10 15 

Ser Leu Val Phe Lys Met Val Lys Lys Phe Arg Xaa Gly Val He Ser 
20 25 30 

Val Asn Asp Val Met Phe Ser Ser Ser He Met Tyr Arg He Lys Lys 
35 40 45 

Asn Ala Phe Ser Leu Thr Val Met Ala He He Ser Ala He Thr Val 
50 55 60 
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His Lys Ala 
1025 

(2) INFORMATION FOR SEQ ID NO:5255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) ' STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5255: 

Gly Glu Lys Cys Met Phe Leu Ala Trp Asn Glu lie Arg Arg Asn Lys 
1 5 10 15 

Leu Lys Phe Gly Leu lie lie Gly Val Leu Thr Met lie Ser Tyr Leu 
20 25 30 

Leu Phe Leu Leu Ser Gly Leu Ala Asn Gly Leu lie Asn Met Asn Lys 
35 40 45 

Glu Gly lie Asp Lys Trp Gin Ala Asp Ala lie Val Leu Asn Lys Asp 
50 55 60 

Ala Asn Gin Thr Val Gin Gin Ser Val Phe Asn Lys Lys Asp lie Glu 
65 - 70 75 80 

Asn Lys Tyr Lys Lys Gin Ala Thr Leu Lys Gin Thr Gly Glu lie Val 
85 90 95 

Ser Asn Gly His Gin Lys Asp Asn Val Leu Val Phe Gly Val Glu Lys 
100 105 110 

Ser. Ser Phe Leu Val Pro Ser Leu lie Glu Gly His Lys Ala Thr Lys 
115 120 125 

Asp Asn Glu Val Leu Ala Asp Glu Thr Leu Lys Asn Lys Gly Leu Lys 
130 135 140 

Leu Gly Asp Thr Leu Ser Leu Ser Xaa Xaa Arg 
145 150 155 



Claims 



1. Computer readable medium having recorded thereon a nucleotide sequence of the Staphylococcus aureus ge- 
nome as depicted in SEQ ID NOS:1-5,191. a representative fragment thereof or a nucleotide sequence at least 
95 % identical to a nucleotide sequence depicted in SEQ ID NOS:1-5,191, 

2. Computer readable medium having recorded thereon any one of the fragments of SEQ ID NOS:1-5,191 depicted 
in Tables 2 and 3 or a degenerate variant thereof. 
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of claim 12. 

17. A method for regulating the expression of a nucleic acid molecule comprising the step of covalently attaching to 
said nucleic acid molecule a nucleic acid molecule consisting of the nucleotide sequence from about 30 to 300 
bases 5' to any one of the fragments of the Staphylococcus aureus genome depicted in Seq 10 Nos:1-5.19l and 
Tables 2 and 3 or a nucleotide sequence at least 95% identical to such a nucleotide sequence or a degenerate 
variant of any of the aforementioned sequences. 

18. A nucleic acid molecule being a homolog of any of the fragments of the Staphylococcus aureus genome of SEQ 
ID NOS:l-5j91 and Tables 2 and 3. wherein said nucleic acid molecule is produced by a process comprising the 
step's of: 

(a) screening a genomic DNA library using as a probe a target sequence defined by any of SEQ ID NOS: 
1-5,191 and Tables 2 and 3. including fragments thereof; 

(b) identifying members of said library which contain sequences that hybridize to said target sequence; 

(c) isolating the nucleic acid molecules from said members identified in step (b). 

19. A DNA molecule being a homolog of any one of the fragments of the Staphylococcus aureus genome of SEQ ID 
NOS:1-5,191 and Tables 2 and 3. wherein said nucleic acid molecule is produced by a process comprising the 

20 steps of: 

(a) isolating mRNA, DNA. or cDNA produced from an organism; 

(b) amplifying nucleic acid molecules whose nucleotide sequence is homologous to amplification primers de- 
rived from said fragment of said Staphylococcus aureus genome to prime said amplification; 

(c) isolating said amplified sequences produced in step (b). 

20. A polypeptide encoded by a fragment of claim 8. 

21. An antibody which selectively binds to any one of the polypeptides of claim 20. 



75 



25 



30 



22. A kit for analyzing samples for the presence of polynucleotides derived from Staphylococcus aureus, comprising 
at least one polynucleotide containing a nucleotide sequence of any one of the fragments SEQ ID NOS:1-5.191 
depicted in Tables 2 and 3 or a nucleotide sequence at least 95% identical thereto or a degenerate variant of any 
of the aforementioned sequences, that will hybridize to a staphylococcus aureus polynucleotide under stringent 

^-5 hybridization conditions, and a suitable container. 

23. A Staphylococcus aureus polypeptide comprising an amino acid sequence identical to an amino acid sequence 
selected from the group consisting of SEQ ID NOS:5.l92 to 5.255 or comprising an amino acid sequence having 
at least 95% identity to such a sequence. 



4$ 



so 
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24. A Staphylococcus aureus polypeptide antigen comprising at least one epitope derived from, a Staphylocc 
aureus polypeptide selected from the group consisting of SEQ ID NOS:5.1 92 to 5,255. 

25. A polypeptide comprising at least one epitope encoded by a Staphylococcus aureus amino acid sequence selected 
from, the group consisting of the epitopic sequences listed in Table 4. 

26. The polypeptide of claim 24 or 26. wherein said polypeptide is fixed to a solid phase. 

27. A diagnostic kit for detecting Staphylococcus aureus infection comprising 

(a) an isolated polypeptide antigen of claim 24. and 

(b) means for detecting the binding of an antibody contained in a biological fluid to said antigen. 

28. A vaccine composition comprising a polypeptide of claim 24 present in a pharmaceutically acceptable carrier. 

29. A method of vaccinating an individual against Staphylococcus aureus infection comprising, administering to an 
individual the vaccine composition of claim 25. 
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Figure 1 
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Figure 2 
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